REMARKS 

Enclosed herewith is a Substitute Specification in which 
the specification as filed has been amended in various places to 
correct typographical and grammatical errors, and also to add 
section headings. 

In support of the above, enclosed herewith is a copy of 
the specification as filed marked up with the above changes. 

The undersigned attorney asserts that no new matter has 
been incorporated into the Substitute Specification. 

The claims have been amended to more clearly define the 
invention as disclosed in the written description. In particular, 
the claims have been amended for clarity. 

Claim 3 has further been amended to clarify the expression 
"a text color is fewer present than a background color." In 
particular, claim 3 now recites "a number of picture units of the 
text color is fewer than a number of picture units of the 
background color." This finds support in the Substitute 
Specification on page 16, line 19 to page 17, line 17, (paragraphs 
[0045] - [0047] ) . ' J, 

Applicants believe that the above changes answer the y, ' 
Examiner's 35 U.S. C. 112, paragraph 2, rejection of claim 3',/^ 
respectfully request withdrawal thereof. 

/35 

The Examiner has rejected claims 1, 7 and 8 und 

AO to 

U.S.C. 102(e) as being anticipated by U.S. Patent 6, Of 



S : \GO\PF12GOD0 . GOR 



6 




Suzuki et al. The Examiner has further rejected claim 2 under 35 
U.S*C. 103(a) as being unpatentable over Suzuki et al. in view of 
U»S. Patent 5,956,468 to Ancin, Moreover, the Examiner has rejected 
claim 3 under 35 U.S.C. 103(a) as being unpatentable over Suzuki et 
al. in view of U.S. Patent 6,038,340 to Ancin et al. and further in 
view of Ancin. In addition, the Examiner has rejected claim 4 under 
35 U.S.C. 103(a) as being unpatentable over Suzuki et al. in view 
of U.S. Patent 6,148,102 to Stolin. Furthermore, the Examiner has 
rejected claim 5 under 35 U.S.C. 103(a) as being unpatentable over 
Suzuki et al. in view of the article "Thresholding and Enhancement 
of Text Images for Character Recognition" by W.W. Cindy Jiang, and 
further in view of U.S. Patent 5,781,658 to 0' Gorman. Finally, the 
Examiner has rejected claim 6 under 35 U.S.C. 103(a) as being 
unpatentable over Suzuki et al. in view of the Jiang article. 

The Suzuki et al. patent discloses a display control 
apparatus having a display of text data and image data and a 
display control method for the same, in which in an signal it is 
detected whether a region is or is not a text region. If the region 
is a text region, scaling is performed with a scale down percentage 
of 66%, and if the region is not a text region, scaling is 
performed with a scale down percentage of 50%. 

The subject invention relates to a process for improving 
text in a signal. To that end, the signal is analyzed to detect 
text regions (as in Suzuki et al.). However, in the subject 
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invention, the entire signal is then subjected to scaling in order 
to match the signal with the display. Then, the scaled signal is 
post processed in dependence on the detected text regions. 

Applicants submit that Suzuki et al. does not show or 
suggest these steps of the inventive method, i.e., scaling the 
entire signal after detection of the text regions, and then post- 
processing the scaled signal in dependence on the previously 
detected text regions. 

The Ancin patent discloses a document segmentation system 
in which a signal having both text and image regions is analyzed. 
The text regions are detected (i.e., "The method and apparatus then 
detect dark text pixels on a light background region of a document 
and assign segmentation labels to each pixel.") As further stated 
in the Abstract, "The pixel labels are post-processed using a 
plurality of syntactic rules to correct mislabeling of pixels." 
Further, "Pixels identified as being in the background region of 
the document are assigned a white label and pixels identified as 
being in the text region are assigned a black label." 

Applicants submit that Ancin only changes the text to 
black and the background to white. Ancin neither shows nor suggests 
that this should be done in a pre-scaling step, and that in a post- 
scaling step, the text and background should be returned to their 
previous colors. 
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Furthermore, Applicants submit that Ancin does not supply 
that which is missing from Suzuki et al., i.e., scaling the entire 
signal after detection of the text regions, and then post- 
processing the scaled signal in dependence on the previously 
detected text regions. 

The Ancin et al. patent discloses a system and method for 
detecting the black and white points of a color image, in which, in 
a particular region, the number of black pixels is compared to the 
number of white pixels to determine whether that region is a text 
region. However, Applicants submit that Ancin et al. or Ancin, 
either individually or collectively, do not supply that which is 
missing from Suzuki et al», i.e., scaling the entire signal after 
detection of the text regions, and then post-processing the scaled 
signal in dependence on the previously detected text regions. 

The Stolin patent discloses recognizing text in a 
multicolor image, in which, in detecting text, "the image is 
separated into multiple blocks. Color distribution of each of the 
blocks are analyzed, and blocks having two main colors are 
identified. The two-color blocks having similar colors are grouped 
into two-color zones, and text in the two-color zones are 
identified." However, Applicants submit that Stolin does not supply 
that which is missing from Suzuki et al., i.e., scaling the entire 
signal after detection of the text regions, and then post- 
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processing the scaled signal in dependence on the previously 
detected text regions. 

The Jiang article discloses "Each sub-pixel in the 
interpolated graytone image has an associated threshold ... If a sub- 
pixel intensity is larger than its threshold, binarize it into a 
black sub-pixel; otherwise make it a white sub-pixel However, as 
Applicants noted above with respect to the other references, Jiang 
does not supply that which is missing from Suzuki et al., i.e., 
scaling the entire signal after detection of the text regions, and 
then post-processing the scaled signal in dependence on the 
previously detected text regions. 

In view of the above. Applicants believe that the subject 
invention, as claimed, is neither anticipated nor rendered obvious 
by the prior art, either individually or collectively, and as such, 
is patentable thereover. 
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Applicants believe that this application, containing 
claims 1-8, is now in condition for allowance and such action is 
respectfully requested. 

Respectfully submitted, 




by-^ ■ 

Edward W. GoddmaHi:; Regr 28,613 
Attorney 

Tel.: 914-333-9611 

CERTIFICATE OF MAILING 

It is hereby certified that this correspondence is being 
deposited with the United States Postal Service as First-class mail 
in an envelope addressed to: 

COMMISSIONER FOR PATENTS 
P.O. Box 1450 
Alexandria, VA 22313-1450 



On U 1^ e ^^oT 

^ 

By %/U-''<'-f\ii^ /J'O^^ 
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RECEIVED 

TEXT IMPROVEiVENT 

JUN 2 0 2003 
Technology Center 2600 

BACKGROUND OF THE INVENTION 
Field Of The Invention 

[0001] The invention relates to a method and device for text 
improvement . 

5 

Description Of The Related Art 

[0002] The article "Thresholding and enhancement of text images 
for character recognition", by W.W. Cindy Jiang, IEEE, Proceedings 
of the international conference on acoustics, speech, and signal 

10 processing (ICASSP) , NY, vol. 20, 1995, pp. 2395-2398, discloses a 
scheme which converts graytone text images of low spatial 
resolution to bi- level images of higher spatial resolution for 
character recognition. A variable thresholding technique and 
morphological filtering are used. It is stated that most optical 

15 character recognition systems perform binarization of inputs before 
attempting recognition, and that text images are usually supposed 
to be binary. 

[0003] The article "A segmentation method for composite 
text/graphics (halftone and continuous tone photographs) 
20 documents", by S. Ochuchi et al . , Systems and Computers in Japan, 
Vol. 24, No. 2, 1993, pp. 35-44, discloses that when processing 
composite documents for digital copy machines and facsimile which 
contain a mixture of text, halftone and continuous tone 
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photographs, ideally, the text portion can be separated from the 
graphics portion and more efficiently represented than the multi- 
bit pixel bitmap graphics representation. 

[0004] NowadaySj_ digital display devices are more and more 
5 frequently matrix devices, e.g.j_ Liquid Crystal Displays, where 

each pixel is mapped on a location of the screen having a one— to^ 
one relationship between raster data and display-^-e points. This 
technology implies the usage of a scaling system to change the 
format of the input video/graphic signal so that it satisfies the 

10 size of the device, i,e.j_ the number of its pixels. The scaling 

block is based on a filter bank that performs pixel interpolation 
when the zooming factor is varying. Actually available solutions on 
the market apply an undifferentiated processing on the graphic 
raster that leads to results with unavoidable artifacts. Usually^ 

15 low-pass filters reduce pixellation, also know as the seesaw 
effect^ on diagonals, and prevent the signal to cuff e r from 
suffering from aliasing due to the sub- sampling, but they also 
introduce other annoying effectSj_ such as_L blurring the images. It 
depends on the content of the displayed signalj_ the relevance of 

20 the perceived artifacts and the kind of artifacts that have to be 
preferred as unavoidable. 



SUMMARY OF THE IIWENTION 
[0005] It is, inter alia, an object of the invention to provide 
25 a simple text improvement for use with displays that require a 
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scaling operation. To thic e nd th e inv e ntion providoc a t e xt 
improvem e nt ac d e fin e d in th e indep e nd e nt claims. Advantag e s 
ombodim e ntc ar e d e fin e d in th e dep e nd e nt claim s . 

[0006] Starting from the above-mentioned observations, a novel 
5 technique is provided here that is able to take into acco\int the 
image content and to apply an ad hoc post -processing only where it 
is required. So Hence , in accordance with the present invention, 
text improvement after the scaling operation is based on text 
detection before the scaling operation. The processing is active 
10 only in presence of text region. A viable area of application of 
this invention is the text readability improvement in the case of 
LCD devices, when, and it is usually the case, we do not want to 
affect other parts of the displayed signal. 

[0007] A remarkable characteristic of the technique presented 
15 here is its really low computational complexity. This aspect 
determines a high effectiveness in terms of cost /performances 
ratio. In fact^ the insertion of the proposed algorithm into the 
other circuitry that carries out all the digital processing needed 
for r e cia e resizing the matrix display device input, presumably 
20 ris es raises the display quality, according with the average user 
perception, without affocting considerable affecting its cost. 
[0008] It is noted that while in one embodiment, a binarization 
takes place, this binarization is only carried out in regions where 
text has been detected, while in the prior art, the binarization is 
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a preliminary step to be carried out before characters can be 
recognized. 

[0009] These and other aspect of the invention will be apparent 
from and elucidated with reference to the embodiments described 
5 hereinafter . 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0010] In the drawings : 

[0011] Figs. 1-3 illustrate the operation of a morphological 
10 filter; and 

[0012] Fig. 4 shows a block diagram of a system in accordance 
with the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
15 [0013] The invention proposes the design of a text detection 
algorithm, together with a post-processing block, for text 
enhancement. It will be shown that the invention significantly 
improves the performance in terms of content readability and leads 
to good perceptual results of the whole displayed signal, while 
20 keeping really low the computational complexity of the total 
scaling system. 

[0014] The organization of the remainder of this document is as 
follows. First the general scaling problem and the current 
available algorithms are briefly summarized. Thereafter concepts 
25 concerning format conversion by a non- integer factor will be 
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introduced. Successivelyj_ the post -processing block, characterized 
by the thresholding operation and the morphological filter, will be 
summarized and its features will be described. Finally the text 
search strategy will be presented and the detection algorithm and 
5 its cooperation with the previously introduced post -processing 
block will be elucidated. 

The g e n e ral f ram e work General Framework 

[0015] Resizing pictures into a different scale requires format 
10 conversion. This operation involves well-known re-sampling theoryj_ 
and classical filter procedures are currently used to accomplish 
with it. Filters avoid aliasing problems in frequency, freeing room 
for the repetitions introduced by the sampling operation in the 
original domain. Among the interpolation filter familieSj_ 
15 polynomial interpolators of first order are commonly usedj_ in which 
the reconstructed pixel is a weighted mean of the nearest pixels 
values. These kinds of filters are also called Finite Impulse 
Response (F.I.R.) filters. 

[0016] Inside standard display devices_^ the format conversion 
20 problem is usually faced with linear filtering too. A particularly 
simple class of F.I.R. filters reconstructs pixels in between two 
available ones tacking the value on the line joining these two 
adjacent points. 

[0017] There are many other possible techniques. For example^ 

25 pixel repetition or polynomial interpolation with more complexes 
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weighting functions. The quality perception of images processed 
with these different solutions is generally not really high, there 
are impairments and artifacts that are not completely avoidable. 
This consideration implies that some compromise is due in order to 
5 reach an acceptable or, at th e best, a satisfactory 
cost/performance ratio. 

[0018] In the past^ the simplest solution solved the problem 
using pixel repetition. A more recent solution, see the Philips 
scaler PS6721, still uses linear filtering but with a slight 
10 different shape of the impulse response, to improve the transition 
steepness . 

[0019] Measuring the rise time of the step response is a 
classical way to assess the performance of the interpolator in 
presence of an edge. In factj_ low low-p ass filters affect edge 
15 steepness and a smooth steepness is perceived as a blurring effect. 
[0020] Moreoverj_ the actual impact of this annoying artifact 
depends on the kind of displayed signal. Actuallyj_ in case of 
natural imageSj_ a blurring effect could be tolerated in a certain 
measure . 

20 [0021] WhereaSj_ for artificial patterns, a slight smootlmess 

effect is recommended only when the content requires approaching a 
natural impression (this is the case of virtual reality and 3D 
games) . In this casej_ filtering is used as an anti-aliasing 
process. For the same reason j_ these kinds of filters are used on 

25 text/characters to avoid the pixellation effect, also know as the 
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seesaw effect^ on diagonals. Interpolation filters are anti- 
aliasing filters too, because they reduce the highest frequency of 
the input signal. Moreover, supposing to have a black text on a 
white background, the amount of gray levels introduced by this kind 
5 of filters should be a less percentage of the black quote. If it is 
not the case, we have an artifact instead of a picture improvement, 
and images are perceived as blurred. For instance, when bilinear 
interpolation as well more sophisticated filterSj_ like the bicubic 
ones, are used on small characters (the commonly used size 10-rl2 
10 points) and thin lines, they appear defocused. In all these cases^ 
it seems better to use no filters at all, at least no low — low-p ass 
filters as are actually available. 

[0022] Starting from the above considerationj_ we can conclude 
that, because format conversion requires resampling, so that the 

15 filtering process is \inavoidable, to accomplish with the above 

issue we have to find out some other solution. In case of text, a 
simple idea is to apply a post-processing block after the scaler to 
clean all the gray levels where characters are detected. Because of 
the scale change, this operation could not be performed using only 

20 a simple threshold block. In factj_ threshold is a non- linear 

operator that introduces fiote ^non- unif orm patterns when it converts 
gray levels characters to binary values, another kind of artifact 
that is highly noticeable. Morphological filters are an interesting 
class of operators that are able to change not non- regular patterns 
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into more regular ones. They will be introduced in a following 
section. 

Format conv e rcion by a rational f actor Conversion By A Rational 
5 Factor 

[0023] In today's digital display devices, images are frequently 
represented with a matrix of pixels so that recpai ring a fixed 
picture format io r e quir e d . When a signal with a different format 
arrives at the input of a matrix display, format conversion is 

10 unavoidable. Supposing If a graphic card had generated the signal, 
than then selecting a different graphic format, instead of the one 
used by the display, depends on the requirement of the software 
application running. At the moment j_ it is not advisable to 
constraint the graphic card output only with the requirement of the 

15 display. 

[0024] We recall that standard today' s graphic formats are VGA, 
SVGA, XGA, SXGA and higher. Format conversion between these raster 
sizes requireSj_ in almost all th e caseSj_ a rescaling by a rational 
factor. This, by itself, leads to a sensible degradation of the 
20 resampled picture. In fact when, for example, we need to change 

format from VGA at display's input to XGA at display's output, the 
factor involved would be 8/5, equal to 1.6 times the size of the 
original picture. This format conversion ratio would clearly 
require a sub-pixel resolution, but with standard linear filtering 
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techniques^ this is not bo allow e d possible without paying a high 
blurring cost . 

[0025] Let s(i,j) be the input signal at position in the 

input gridj_ and s(i J) be the signal after format conversion at 
5 position (ij) in the thicker output grid. Supposing to hav e In the 
case of a rescaling from VGA to XGA, i.e.^ by an 8/5 factor, for 
every 5 pixels at the input of the sampler there will be 8 pixels 
at its output. A rescaling by a rational factor conceptually relies 
on an intermediate "super- resolution" grid obtained using a zooming 

10 factor equal to the numerator, in the example 8. In this case^ the 
"super- resolution" grid will be eight times thicker than the 
original one. Tacking two input values, s(i,j) and s(i + \^j), on the 
same line j at position / and / + the interpolated value s{i,j) 
will be positioned in between the two original values in the super- 

15 resolution gird, i.e.j_ in one of the eight possible positions 

available on the grid. W e e xpr e ss this fact This fact is expressed 
by the following equation: 

k r 1 

sii + - , y) = >v, • s(i, j) + • s(i -f 1, j) € [0 ... 7 J 

o 

20 

Where, for the linear interpolatorj_-r 

Ul = S 
\w2 = \-S 
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k is the position of the pixel in the dense grid, the position 
also being called filter phase. In a linear filter , the position is 
Sock, and S is the distance between the pixel to be interpolated 
with respect to the two adjacent original ones. The signal at the 
5 output grid will be obtained tacking values on a grid 5 times 
weaker. The s\ib- sampled signal at the output is expressed as—L^ 
follows : 



10 

Because the output grid is not a multiple of the input grid, often 
original pixel values will be lost and they will be replaced with 
an average value according with what w e s aid to the above. If the 
input pattern would b e is a black and white text, its pixels will be 
15 frequently replaced by a weighted average of their values, a gray 
level . 

Text improv e m e nt via thr e sholding lmprovement Via Thresholding 
[0026] A threshold operator placed at the output of the scaling 
20 filter will recover a black and white pattern, orj_ more generally^^ 
a bicolor one, choosing the threshold nearest value according to 
the following relationship: 



25 



/. if s\i^^^kJ)>S 
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Where is the black level and is the white ©nelevel. 
[0027] W e could notic e lt should be noted that, in case of 
black/white and bicolor patterns, the threshold function could be 
integrated in the filter operator, setting and in accordance 
5 with the actual filter phase. In this way^ the threshold operation 
recovers original bicolor levels from the interpolated ones 
according with theirs new positions. In regions where the amount of 
gray levels introduced is too h e ight high , this simple operator 
improves the sharp edge perception. Anyway^^ this is paid with the 
10 introduction of irregular patterns. In the next sectionj_ w e will 
s ee how a solution to this problem could b e s olv e d is presented . 

Morphological filt e ring algorithms Filtering Algorithms 
[0028] The introduction of mathematical morphology to solve the 

15 problem of text deblurring is due to the fact that a morphological 
filter, working both as a detector and as a non- linear operator, is 
able to eliminate gray levels without destroying the character 
regularity. Moreover, in case of bicolor patterns, a morphological 
filter is able to recover a specified regularity where required. 

20 [0029] In generalj_ the detector, called structuring element, is 

a small matrix (usually 2x2 or 3x3)-; — 3r fe. It can recognize a 
particular pattern on the data, in our case^ the rasterized image's 
pixels at the display output, and to— substitute that pattern with a 
different set of requested values. 
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Supposing to us e [0030] When the morphological filter is 

used after the threshold block, on a bi- level pattern, the 
structuring element will work as a binary mask on the underlying 
data^ performing a set of logical operations between the bit of the 
running matrix and the bit of the scanned data. An output equal at 
to 1 will signify that a specified pattern has been identified. 
[0031] A particular operator belonging to the morphological 
filter family, also called "diagonal" filter, applies the following 
set of logical operations to the data: 



P^ =iX',nX,nX',nX,) 

P3 =(XlnX,nX'^nX^) 

P,=iXlnX,nX',nX,) 
Here, X^-'X^ is the set of data currently analyzed by the 

structuring element ; b es id es . In addition , in case of binary data, 

15 u is the classic logical OR operator and n is the classic logical 

AND operator. The structuring element orders the data in its 

framework as shown in Fig. 1. 

[0032] The output, y, after the set of logical operations abov e 
introduce d above , replaces the previous value at the origin of the 
20 data matrix, X^ in th e figur e Fig. 1 . On e can not ic e It should be 
noted that, if the result of P^kjP^kjP^uP^ is 0, than then X^ 
remains unchanged, if, instead, if— the result if 1 tha n , then X^ is 
always replaced by 1 . 

[0033] Looking carefully, it will be evident that the set P^ , 
25 P2 , P3 , of logical operations corresponds to the detection of 
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the patterns shown in Fig. 2. Patterns in Fig. 2 are diagonal 
patterns of black and white pixels, in case of binary images. 
According with in accordance with the above relations, when one of 
these configurations is found, ^ba Rthen , in the origin of the 
5 detected region identified by a circle in th e figur e Fig. 2 , a 0 
value is substitute by a 1. This operation, in terms of pattern 
effect, fills holes in diagonal configurations, 
[0034] On e could notic e lt should be noted that the same 
operation could be done, instead of using logical operators, with a 
10 LUT addressed by the configuration of bits in the structuring 
element. Supposing to ord e r When the cells of the element are 
ordered according with th e abov e figur e to Fig. 2 , this 
configuration has the following address: 

where each X. is correspondingly equal at 1 or 0 according with the 
value in the i^^ position of the matrix. To fill holes, the LUT at 
position XXXIQXQXX , OIXIQXXXX , XIQXOIXXX , XXXXQXXIO , will be set 
20 at 1, in all of the other positions, it will be set at 0. Here X 
means Mon • t care_[]^ . 

[0035] From a conceptual point of view, because of the holes 
filling function of the "diagonal" structuring element, the set of 
operations described above on a structuring element, are equivalent 
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a^ to changing a diagonal patterns, anyhow oriented in a matrix, 
with a uniform block. This concept is clarified in Fig. 3. 

Block diagram of s yst e m e mbodim e nt Piagram Of System Embodiment 
5 [0036] Th e drawing in Fig. 4 shows a block diagram of the total 
system in which the main concept of the architecture for the 
detector and the post p ost -p rocessing block are sk e tch e d drawn . An 
input image Inim s is applied to a Search Window part SW and Text 
Detector part Det. The input image Inim s, possibly modified in 
10 some region by the text detector part Det, is applied to a scaler 
Seal, if recognized as text by the text detector part Det, such as 
the commercially available scaler PS6721. The scaled image from the 
scaler Seal is applied to a post-processing part Post-proc that 
produces the output image Outim s* . 

15 

Search Window and Text Detector 

[0037] The search window and text detector is a key operator. In 
fact_^ it depends on it^if the input signal will be binarized and 
further processed or simply filtered with the linear scaler. 
2 0 According to what i^was previously «a4rdstated, detection is 

specifically designed to recognize text patterns. When the required 
constraint imposed at the detector are not satisfied, the signal 
does not eventually benefit of this from further processing step. 
Detection is performed with a local sensor that recognizes the 
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amount of colors in a small region. SOj_ in principle^ it works as a 
search window that scan the raster image to discover text areas. 
[0038] To design it, satisfying a low memory cost, a fixed 
vertical width was used, equal to 3 lines on the domain of the 
5 original signal. Instead_^ its horizontal depth is varying varies 

according to the image characteristics^^ and it is based on a simple 
growth criterion defined using some intuitive assumptions on the 
text properties. Currently assumptions on graphic text are as— it 
follows : 

10 

[0039] l._ text area is a two-color region in which text 

is one color and the other color is the background. 

[0040] 2. ^_In a text area^ a text color is perceptually 

fewer present than a background color. 

15 [0041] 3. text region has a reasonable horizontal 

extension. 

[0042] These assumptions determine the constraints on the 
patterns the detector recognizes as text regions. As we— ca n be 

20 seen, neither filtered text nor not-uniform backgroimd are 

recognized as text regions. This is a reasonable assumption because 
the threshold operator in these cases would introduce more 
artifacts than benefits. Furthermorej_ the not^imbalanced percentage 
of the two colors prevents the detector from identifying^^ as textj_ 

25 two color regions with potentially dangerous patterns. An example 
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is the chess pattern, quite recurrent, for examplej_ in window 
folder background. Finallyj_ the third condition prevents ^ 
id e nt i f y ident i f ying as text regions small bicolor fragments of the 
raster signal, that could be— presumably be border or other small 
5 pieces of graphic objects. 

[0043] The conditions introduced above are used to define some 
parameters to adjust the behavior of the detector such that it 
could may reach the best performances. Let w e c Consider now the 
above introduced input raster signal s(r^c) at position (r,c) . The 

10 search window will be indicated with q{r,c) , with (r,c) being the 

coordinates of the block's origin that identify its reference pixel 
in the image; the relative coordinates, identifying a cell in the 
search window, are referred to the block origin and they will be 
noted as . Furthermore j_ the detector height and width will be 

15 indicated with h and w . Wh e r e as While w is a varying parameter, 
on the contraryj_ h is fixed, to satisfy line memory constraints, 
and its value is currently h = h = 3. 

[0044] Let be-iV^ be the number of colors detected in the search 
window. According with to the previously described block growing, 
20 the width w will increase following this search strategy: 

^w{k + l)^w{k) + l if N^<2 
^ w{k + \)^w{k) = w if N^>2 
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N^>2 is the exit condition from the growing search strategy. When 
the exit condition is verified, the system will return the final 
block width w . 

[0045] Together with the block growing process, two color 
counters will be incremented at each new step k . It could should be 
notic e noted that a step k corresponds to the evaluation of a new 
input pixel in the horizontal direction. Calling the number of 
pixels with color and ^2 number of pixels with color , the 

counters will be upgraded according in accordance with the 
corresponding block growing step in the following manner: 

/,(r + l) = /,(r) + l // ^(/ + H<^ + lX7 + A) = Ci for /r = 1...3 
7i (r + 1) = 7, (r) otherwise 

and 

/2(r4-l) = 72(^) + l if qii-^w(k'\-l),J + h) = c, for /z = 1...3 
^2 (r + 1) = ^2 (^) Otherwise 

r = 3- w(A: + l)-h A is a new counting step in the search window, a new 
pixel evaluated using the growing window at item k . 
[0046] Finall y l e t w e introduc e ^ ^ the last parameter ^ , 
representing the ratio between the two colors counters, is 
introduced once the background is identified, according to the 
following relationship : 
y 

^ = — // /x-yi ^ ^\ ~ background 
Y 

^ = — if Y\<'Yi ^2 = background 
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[0047] Once the algorithm is exited from the search strategy, 
the detection window is available to identify its content. 
[0048] As mentioned above, a first condition to be satisfied, so 
5 that a region would can be recognized as text, is that the block 
has a reasonable extension. Let— be: 



e = min w 



10 the minimum value, in terms of pixels, allowed for a region to be 
recognized as text region. The condition to be satisfied by a text 
region will be: 



15 

The current value fixed for the parameter £ is: ^ = 300. 
[0049] Recalling that ^ is the ratio between the background and 
the text colors, a second condition to be satisfied, so that the 
block would be recognized as a text areaj_ will be: 

20 



where ^ is a modifiable parameter actually fixed as ^ = 1.2. In 
other words: 

25 
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if ^ <^ qW not a text window 

[0050] The block will be discarded as not a text block when one 
of the above conditions are not satisfied. The new search window 
5 will be ^(r,c-fw) and it will start at position (r^c + w) in the 

original image, or (r + 3,c) depending if^ in the previous stepj_ the 
end of line was reached. 

[0051] Following this strategyj_ the entire image will be scanned 
by the search window and text region will be detected. As text is 
10 detected^ the previously described post -processing operations will 
be applied. 

[0052] Going back to Fig. 4, an input image is first subjected 
to a block-growing process BlGr based on whether the number of 
different colors does not exceed 2 (N^ < 2) , a first indication for 

15 the presence of text. As soon as the number of colors exceeds 2, 

the block growing process BlGr is stopped, and the other parameters 
Outpar are determined, which represent the three criteria for text 
listed above. Based on these parameters Outpar, it is determined 
whether there is a text region (Txt reg ?) . If so, the backgroxind 

20 color Cbackground IS Set to whlte , and the text color Ctext is set to 
black. 

[0053] The resulting image is subjected to the scaling operation 
SCAL. 

[0054] After the scaling operation SCAL, the text region is 
25 subjected to a thresholding operation (threshold , the output of 
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which is applied to a morphological filter (Morph. Filt.). 
Thereafter, white is set back to the backgroiind color Cbackground/ s-^-d 
black is set back to the text color c^ext- The result of this 
operation forms the output image Outim s* that is displayed on a 
5 matrix display D. 

[0055] A primary aspect of the invention can be summarized as 
follows. A novel technique is suggested which is able to take into 
account the image content and to apply an ad-hoc scaler post- 
processing only where it is requested. A viable area of application 

10 of this invention is the text readability improvement in the case 
of LCD devices, when, and it as is usually the case, we do not want 
to affect other part of the displayed signal. It is, inter alia, an 
object of the invention to provide an ad hoc simple text detector. 
The invention proposes the design of a text detection algorithm, 

15 together with a post-processing block, for text enhancement. The 
invention significantly improves the performance in terms of 
content readability and leads to good perceptual results of the 
whole displayed signal, while keeping really low the computational 
complexity of the total scaling system. The invention is preferably 

20 applied in LCD scaler ICs . 

[0056] It should be noted that the above-mentioned embodiments 
illustrate rather than limit the invention, and that those skilled 
in the art will be able to design many alternative embodiments 
without departing from the scope of the appended claims. In the 

25 claims, any reference signs placed between parentheses shall not be 
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construed as limiting the claim. The word "comprising" does not 
exclude the presence of elements or steps other than those listed 
in a claim. The word "a" or "an" preceding an element does not 
exclude the presence of a plurality of such elements. The invention 
can be implemented by means of hardware comprising several distinct 
elements, and by means of a suitably programmed computer. In the 
device claim enumerating several means, several of these means can 
be embodied by one and the same item of hardware. The mere fact 
that certain measures are recited in mutually different dependent 
claims does not indicate that a combination of these measures 
cannot be used to advantage. 
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ABSTRACT OF THE DISCLOSURE- i- 

In a method of text improvement, in an image text in an 
image is detected (SW, Det) , the image is scaled (Seal) to adjust 
first numbers of pixels per line and lines per image of the image 
to second numbers of pixels per line and lines per image that fit 
in with a display (D) on which the image is to be displayed, and 
the image is processed (Post-proc) in dependence on a result of the 
text detection. 



(Fig. 4) 
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